CDS

Accession Number TCMCG078C11301
gbkey CDS
Protein Id KAG0467511.1
Location join(29130721..29130777,29131505..29131582,29143152..29143193,29169366..29169398,29169479..29169572,29169652..29169684,29169758..29169904,29169989..29170119,29170197..29170334,29170421..29170519,29170599..29170892)
Organism Vanilla planifolia
locus_tag HPP92_019091

Protein

Length 381aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000009.1
Definition hypothetical protein HPP92_019091 [Vanilla planifolia]
Locus_tag HPP92_019091

EGGNOG-MAPPER Annotation

COG_category I
Description Serine aminopeptidase, S33
KEGG_TC -
KEGG_Module M00098        [VIEW IN KEGG]
KEGG_Reaction R01351        [VIEW IN KEGG]
KEGG_rclass RC00020        [VIEW IN KEGG]
RC00041        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K01054        [VIEW IN KEGG]
EC 3.1.1.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00561        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko04714        [VIEW IN KEGG]
ko04723        [VIEW IN KEGG]
ko04923        [VIEW IN KEGG]
map00561        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map04714        [VIEW IN KEGG]
map04723        [VIEW IN KEGG]
map04923        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGATGGATCTAAATACCTTTAAAATTCTTGGGTTGATAGACCTTATCACATGCAAGACTTACTCCATGTTCAATATTATCGTCATCAACCTAGAGGATGAGGTTGCAATGCCCAAATCGTGGAAGAGTTCCAAAAATCATGATGAGATGCTTAAGACACTGAGAACAATTCTATGGGCGGTGGTCACTGGAAATGTTAAGTTTGAAGAGGAGTTCATTCTGAATTATCGAGGAGTGAAACTGTATACCTGCCGGTGGACACCGGCGAACCGAAACCCCAAAGCTTTGGTCTTCCTTTGTCATGGATATGCCATGGAGTGTAGCATCTCAATGAGAGTCACAGCGATTAGATTGACAGAGGCCGGGTTTACTGTCTACGGGATGGACTATGAAGGCCATGGCAAGTCCTCTGGCTTGCAGGGCTACATCCCTAACTTTGATGACCTTGTGAACGATTGCTCCGAATATTATACTTCTGTTTGTGAGAGGAAAGAGAACAAGGATAAGGTGAGGTTTCTGCTTGGTGAATCCATGGGAGGTGCCGTGGCCCTTCTCTTGCATAGGAAGAAGCCGGTCTTTTGGAATGGAGCTGTTCTAGTTGCTCCAATGTGCAAGATTGCTGAAGAGATGAAACCTCATCCATTGGTCATCAACATGCTAACGAAACTTTGTAGAGTTATTCCAACGTGGAAGATTGTTCCTTGTAAGGAAGTCATCGATAGCGCCTTCAAGAGTCCAGAATGGAGAGAAGAGATTCGAAACAATCCTCATTGCTACAAGGGTAAGCCTCGCCTAAAGACTGGTTATGAACTTCTTATGGTGAGCATGGATATTGAAAAGAACTTGAATCAAGTATCATTACCTTTCATCATCATTCATGGTGGTGAAGACATCGTCACGGATCCCTCAGCGAGCCAAGCTCTCTACGAAACGTCAAAGAGTGAAGACAAGACCTTTAAGCTCTACCCTGGGATGTGGCATGCCCTGACATCCGGTGAGCCGCAGGAAAACATCGATCTTGTTTTCTCGGACATCATTTCATGGCTCGATGACCGGGCGATAACGATGAGCTCAAGATTGGAGATGCAGAAGAAGGCCGAACATGACACACAAGTACTGTTTGAAGAATCATTCAAGAAAGCATAA
Protein:  
MMDLNTFKILGLIDLITCKTYSMFNIIVINLEDEVAMPKSWKSSKNHDEMLKTLRTILWAVVTGNVKFEEEFILNYRGVKLYTCRWTPANRNPKALVFLCHGYAMECSISMRVTAIRLTEAGFTVYGMDYEGHGKSSGLQGYIPNFDDLVNDCSEYYTSVCERKENKDKVRFLLGESMGGAVALLLHRKKPVFWNGAVLVAPMCKIAEEMKPHPLVINMLTKLCRVIPTWKIVPCKEVIDSAFKSPEWREEIRNNPHCYKGKPRLKTGYELLMVSMDIEKNLNQVSLPFIIIHGGEDIVTDPSASQALYETSKSEDKTFKLYPGMWHALTSGEPQENIDLVFSDIISWLDDRAITMSSRLEMQKKAEHDTQVLFEESFKKA